Unsupervised Cross-Adaptation Approach for Speech Recognition by Combined Language Model and Acoustic Model Adaptation

نویسندگان

  • Tetsuo Kosaka
  • Taro Miyamoto
  • Masaharu Kato
چکیده

The aim of this study is to improve speech recognition with a combination of language model (LM) and the acoustic model (AM) adaptation. The proposed adaptation techniques are based on cross-system adaptation or cross-validation (CV) adaptation. The principle is to use complementary information derived from several systems or data sets. Because language information and acoustic information differ completely, the combined approach is expected to be effective. We evaluate the performance of the proposed methods by conducting speech recognition experiments using the Corpus of Spontaneous Japanese (CSJ). Both cross-system adaptation and CV adaptation give better performance than the conventional adaptation method; the crosssystem adaptation method was found to exhibit the best recognition performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Unsupervised Language Model Adaptation Using Word Classes for Spontaneous Speech Recognition

This paper proposes an unsupervised, batch-type, class-based language model adaptation method for spontaneous speech recognition. The word classes are automatically determined by maximizing the average mutual information between the classes using a training set. A class-based language model is built based on recognition hypotheses obtained using a general word-based language model, and linearly...

متن کامل

Improvement of Lecture Speech Recognition by Using Unsupervised Adaptation

The aim of this work is to improve the recognition performance of spontaneous speech. In order to achieve the purpose, the authors of this chapter propose new approaches of unsupervised adaptation for spontaneous speech and evaluate the methods by using diagonal-covariance and full-covariance hidden Markov models. In the adaptation procedure, both methods of language model (LM) adaptation and a...

متن کامل

Online Unsupervised Multilingual Acoustic Model Adaptation for Nonnative Asr

Automatic speech recognition (ASR) is currently one of the main research interests in computer science. Hence, many ASR systems are available in the market. Yet, the performance of speech and language recognition systems is poor on nonnative speech. The challenge for nonnative speech recognition is to maximize the accuracy of a speech recognition system when only a small amount of nonnative dat...

متن کامل

Unsupervised Language and Acoustic Model Adaptation for Cross Domain Portability

This work investigates the task of porting a broadcast news recognition system to a conversational speech domain, for which only untranscribed acoustic data are available. An iterative adaptation procedure is proposed that alternatively generates automatic speech transcriptions and performs acoustic and language model adaptation. The procedure was applied on a tourist-information conversational...

متن کامل

Live speech recognition in sports games by adaptation of acoustic model and language model

This paper proposes a method to automatically extract keywords from baseball radio speech through LVCSR for highlight scene retrieval. For robust recognition, we employed acoustic and language model adaptation. In acoustic model adaptation, supervised and unsupervised adaptations were carried out using MLLR+MAP. By this two level adaptation, word accuracy was improved by 28%. In language model ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011